Efficient representation of throat microphone speech
نویسندگان
چکیده
The objective of this work is to represent the information in the speech signal picked up by a throat microphone (TM) in an efficient manner in terms of number of bits required. Since the TM signal is unaffected by ambient noise, it is possible to extract the required information effectively under different environmental conditions. A spectral mapping technique is proposed from the TM speech to normal microphone (NM) speech to improve the perceptual quality. The mapping is done using vector quantization of pairwise spectral feature vectors derived from each frame of TM and the corresponding NM speech signals. Once the codebook is formed, the spectral features from a TM signal are represented as a sequence of codebook indices. The sequence of codebook indices, the pitch contour and the energy contour derived from the TM signal are used to store/transmit the TM speech information efficiently. From the received sequence of codebook indices, the NM spectral vectors are retrieved due to pairwise vector quantization of the feature vectors. A synthetic residual signal is generated at the receiver from prestored residual templates by incorporating the pitch and the energy. The synthetic residual signal is used to excite the system corresponding to the NM spectral vectors to generate the speech signal.
منابع مشابه
An analytic modeling approach to enhancing throat microphone speech commands for keyword spotting
This research was carried out on enhancing throat microphone speech for noise-robust speech keyword spotting. The enhancement was performed by mapping the log-energy in the Mel-frequency bands of throat microphone speech to those of the corresponding close-talk microphone speech. An analytic equation detection system, Eureqa, which can infer nonlinear relations directly from observed data, was ...
متن کاملA new statistical excitation mapping for enhancement of throat microphone recordings
In this paper we investigate a new statistical excitation mapping technique to enhance throat-microphone speech using joint analysis of throatand acoustic-microphone recordings. In a recent study we employed source-filter decomposition to enhance spectral envelope of the throat-microphone recordings. In the source-filter decomposition framework we observed that the spectral envelope difference ...
متن کاملMapping Speech Spectra from Throat Microphone to Close-Speaking Microphone: A Neural Network Approach
Speech recorded from a throat microphone is robust to the surrounding noise, but sounds unnatural unlike the speech recorded from a close-speaking microphone. This paper addresses the issue of improving the perceptual quality of the throat microphone speech by mapping the speech spectra from the throat microphone to the close-speaking microphone. A neural network model is used to capture the sp...
متن کاملCombination of standard and throat microphones for robust speech recognition in highly noisy environments
We present a method to combine standard and throat microphone signals for noise-robust speech recognition. Our approach is to extend the probabilistic optimum filter (POF) mapping algorithm to estimate standard microphone clean speech feature vectors from both microphones’ noisy speech feature vectors. We tested the proposed approach in two noisy speech recognition tasks. In the first task we u...
متن کاملRobust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech
Accuracy of automatic speaker recognition (ASV) systems degrades severely in the presence of background noise. In this paper, we study the use of additional side information provided by a body-conducted sensor, throat microphone. Throat microphone signal is much less affected by background noise in comparison to acoustic microphone signal. This makes throat microphones potentially useful for fe...
متن کامل